Incorporating Biological Domain Knowledge into Cluster Validity Assessment

نویسندگان

  • Nadia Bolshakova
  • Francisco Azuaje
  • Padraig Cunningham
چکیده

This paper presents an approach for assessing cluster validity based on similarity knowledge extracted from the Gene Ontology (GO) and databases annotated to the GO. A knowledge-driven cluster validity assessment system for microarray data was implemented. Different methods were applied to measure similarity between yeast genes products based on the GO. This research proposes two methods for calculating cluster validity indices using GO-driven similarity. The first approach processes overall similarity values, which are calculated by taking into account the combined annotations originating from the three GO hierarchies. The second approach is based on the calculation of GO hierarchyindependent similarity values, which originate from each of these hierarchies. A traditional node-counting method and an information content technique have been implemented to measure knowledge-based similarity between genes products (biological distances). The results contribute to the evaluation of clustering outcomes and the identification of optimal cluster partitions, which may represent an effective tool to support biomedical knowledge discovery in gene expression data analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incorporating Prior Knowledge into Extension Neural Network and its Application to Recognition of Safety Status Pattern of Coal Mines

Incorporating prior knowledge (PK) into learning methods is an effective means to improve learning performance. On the bases of requirements of engineering practice and the characteristics of knowledge representation of extension neural network (ENN), with the purpose of further improving the performance of ENN in engineering practice, a prior-knowledge-based ENN (PKENN) recognition method is p...

متن کامل

Validity and reliability of the Persian version of Dementia Knowledge Assessment Scale and Dementia Attitude Scale in Iranian health workers about Alzheimer’s disease

Abstract Introduction: There are various questionnaires in the world to assess the level of knowledge and attitudes of people about Alzheimer’s disease. Dementia Knowledge Assessment Scale (DKAS) and Dementia Attitude Scale (DAS) questionnaires are two well-known questionnaires in this field, respectively. The purpose of this study is translation and psychometrics evaluation of two questionnair...

متن کامل

Clustering Hand-Drawn Sketches via Analogical Generalization

One of the major challenges to building intelligent educational software is determining what kinds of feedback to give learners. Useful feedback makes use of models of domain-specific knowledge, especially models that are commonly held by potential students. To empirically determine what these models are, student data can be clustered to reveal common misconceptions or common problem-solving st...

متن کامل

A knowledge-driven approach to cluster validity assessment

UNLABELLED This paper presents an approach to assessing cluster validity based on similarity knowledge extracted from the Gene Ontology. AVAILABILITY The program is freely available for non-profit use on request from the authors.

متن کامل

Incorporating Prior Domain Knowledge Into Inductive Supervised Machine Learning Incorporating Prior Domain Knowledge Into Inductive Machine Learning

The paper reviews the recent developments of incorporating prior domain knowledge into inductive machine learning, and proposes a guideline that incorporates prior domain knowledge in three key issues of inductive machine learning algorithms: consistency, generalization and convergence. With respect to each issue, this paper gives some approaches to improve the performance of the inductive mach...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006